首页> 外文OA文献 >Communication-aware MCMC method for big data applications on FPGAs
【2h】

Communication-aware MCMC method for big data applications on FPGAs

机译:用于FpGa上大数据应用的通信感知mCmC方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

© 2017 IEEE. Markov Chain Monte Carlo (MCMC) based methods have been the main tool for Bayesian Inference for some years now, and recently they find increasing applications in modern statistics and machine learning. Nevertheless, with the availability of large datasets and increasing complexity of Bayesian models, MCMC methods are becoming prohibitively expensive for real-world problems. At the heart of these methods, lies the computation of likelihood functions that requires access to all input data points in each iteration of the method. Current approaches, based on data subsampling, aim to accelerate these algorithms by reducing the number of the data points for likelihood evaluations at each MCMC iteration. However the existing work doesn't consider the properties of modern memory hierarchies, but treats the memory as one monolithic storage space. This paper proposes a communication-aware MCMC framework that takes into account the underlying performance of the memory subsystem. The framework is based on a novel subsampling algorithm that utilises an unbiased likelihood estimator based on Probability Proportional-to-Size (PPS) sampling, allowing information on the performance of the memory system to be taken into account during the sampling stage. The proposed MCMC sampler is mapped to an FPGA device and its performance is evaluated using the Bayesian logistic regression model on MNIST dataset. The proposed system achieves a 3.37x speed up over a highly optimised traditional FPGA design, therefore the risk in the estimates based on the generated samples is largely decreased.
机译:©2017 IEEE。基于马尔可夫链蒙特卡洛(MCMC)的方法已经成为贝叶斯推理的主要工具,多年来,最近在现代统计和机器学习中得到了越来越多的应用。然而,随着大型数据集的可用性和贝叶斯模型的日益复杂性,MCMC方法对于现实世界中的问题变得越来越昂贵。这些方法的核心是似然函数的计算,该函数需要在该方法的每次迭代中访问所有输入数据点。基于数据二次采样的当前方法旨在通过减少用于每次MCMC迭代的似然评估的数据点的数量来加速这些算法。但是,现有工作没有考虑现代内存层次结构的属性,而是将内存视为一个整体存储空间。本文提出了一种通信感知的MCMC框架,该框架考虑了内存子系统的基本性能。该框架基于一种新颖的子采样算法,该算法利用了基于概率比例大小(PPS)采样的无偏似然估计器,从而允许在采样阶段考虑有关存储系统性能的信息。拟议的MCMC采样器映射到FPGA器件,并使用MNIST数据集上的贝叶斯逻辑回归模型评估其性能。与高度优化的传统FPGA设计相比,拟议的系统可达到3.37倍的速度,因此,基于生成的样本进行估算的风险大大降低了。

著录项

  • 作者

    Liu, S; Bouganis, CS;

  • 作者单位
  • 年度 2017
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号